Sticking the landing: A simple reduced-variance gradient for ADVI

نویسندگان

  • Geoffrey Roeder
  • Yuhuai Wu
  • David Duvenaud
چکیده

Compared to the REINFORCE gradient estimator, the reparameterization trick usually gives lower-variance estimators. We propose a simple variant of the standard reparameterized gradient estimator for the evidence lower bound that has even lower variance under certain circumstances. Specifically, we decompose the derivative with respect to the variational parameters into two parts: a path derivative and the score function. Removing the second term produces an unbiased gradient estimator whose variance approaches zero as the approximate posterior approaches the exact posterior. We propose that the removed term has arbitrarily high variance when the variational posterior has a complex form, as when using adaptive posteriors such as given by normalizing flows or stochastic Hamiltonian inference. 1 Estimators of the Evidence Lower Bound Variational inference posits a family of distributions Q and attempts to find an approximate posterior qφ by optimizing the evidence lower bound (ELBO): L(φ) = Ez∼q[log p(x, z)− log qφ(z |x)] (ELBO) An unbiased approximation of the gradient of the ELBO allows stochastic gradient descent to scalably learn complex models. When the joint distribution p(x, z) can be evaluated by p(x|z) and p(z) separately, the ELBO can be written in the following three forms: L(φ) = Ez∼q[log p(x|z) + log p(z)− log qφ(z|x)] (1) = Ez∼q[log p(x|z) + log p(z))] +H[qφ] (2) = Ez∼q[log p(x|z)]−KL(qφ(z|x)||p(z)) (3) By sampling z ∼ q(z), equations (1), (2), and (3) can be used to construct Monte-Carlo estimates of the ELBO. When p(z) and q(z|x) are multivariate Gaussians, using (3) is appealing because it analytically integrates out terms that would otherwise have to be estimated by Monte Carlo. Intuitively, we might that using exact integrals wherever possible will give lower-variance estimators. Surprisingly, though, there are circumstances under which (1), which we call the full Monte Carlo estimator of the ELBO, has lower variance than the estimator based on (3) which calculates the KL divergence exactly. Specifically, when q(z|x) = p(z|x), i.e. the variational approximation is exact, then the variance of the full Monte Carlo estimator is exactly zero, since its value is a constant

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sticking the Landing: An Asymptotically Zero-Variance Gradient Estimator for Variational Inference

We propose a simple and general variant of the standard reparameterized gradient estimator for the variational evidence lower bound. Specifically, we remove a part of the total derivative with respect to the variational parameters that corresponds to the score function. Removing this term produces an unbiased gradient estimator whose variance approaches zero as the approximate posterior approac...

متن کامل

A Model for Runway Landing Flow and Capacity with Risk and Cost Benefit Factors

As the demand for the civil aviation has been growing for decades and the system becoming increasingly complex, the use of systems engineering and operations research tools have shown to be of further use in managing this system. In this study, we apply such tools in managing landing operations on runways (as the bottleneck and highly valuable resources of air transportation networks) to handle...

متن کامل

A Semi-Automated Algorithm for Segmentation of the Left Atrial Appendage Landing Zone: Application in Left Atrial Appendage Occlusion Procedures

Background: Mechanical occlusion of the Left atrial appendage (LAA) using a purpose-built device has emerged as an effective prophylactic treatment in patients with atrial fibrillation at risk of stroke and a contraindication for anticoagulation. A crucial step in procedural planning is the choice of the device size. This is currently based on the manual analysis of the “Device Landing Zone” fr...

متن کامل

Effect of colloidal Particles associated with the liquid bridge in sticking during drying in Superheated Steam

It is very important in the design of a drying system is to evaluate sticking behaviour of the materials goes under drying. A new approach to the sticking issue is applied in this study by carrying out a sticking test for the liquid associated with the materials under study. It was found that the liquid bridge is responsible of the initial sticking of the materials to the contact surface and th...

متن کامل

Mathematical Modeling of the Differential Sticking Coefficient of Clay Drilling Fluids

The main objective of this work is to propose a mathematical model for the differential sticking coefficient of clayey drilling fluids with a lubricant as an additive and evaluate the influence of differential pressure and lubricant content on filter cake thickness and permeability. Tests were carried out on fluids composed of water and 4.86% of active bentonite clay prepared in Hamilton Beach ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017